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Web Results 1 - 10 of about 82,700 for searching internet usage analyzer indexing data files folders image 

Tip: Looking for pictures? Try Google Images 

Mark's Sysinternals Blog: Sony. Rootkits and Digital Rights ... 

... her data files on a separate partition and I'm gonna create an image of her c: 

... has a great AUDIO file discussing over this mess now (well done MARK! ... 

www.sysintemals.com/blog/2005/ 10/sony-rootkits-and-digital-rights.html - 513k - Cached - Similar pages 



[pdf] Guidelines on PDA Forensics 

File Format: PDF/Adobe Acrobat - View as HTML 

String searches, keyword searches, and text string searches. Internet-related 
evidence, such as Web site traffic analysis, chat logs, cache, files, e-mail ... 
csrc.nist.gov/publications/nistpubs/800-72/sp800-72.pdf - Similar pages 



internet browser Software. Freeware, Shareware. Downloads 
Designed to search files for e-mail addresses, Email Logger utilizes ... 
Small Internet Browser for Advanced Image Search with Google search engine. ... 
www.ttuga.com/software/27/internet-browser.html - 62k - Cached - Similar pages 



Tortuga Software Downloads 

synchronize clock • synchronize files ■ synchronize folders ... unzip files • 
upc codes - update tool - upload image * ups online • usage analysis ... 

www.ttuga.com/ - 289k - Cached - Similar pa ges 



Utility 

Use to manage all nonpermanent disks data; view archives contents (up to 6 formats 
including self-extracting files); view files, folders, packed files and ... 

www.window95.com/windows/utility.html - 53k - Cached - Similar pa ges 

download directory, download guide Bizeurope.com 
View several folders at once containing any number of image files. ... Internet Trail 
Remover secures your privacy by searching and removing trails that ... 
www.bizeurope.com/downloads.htm - 178k - Cached - Similar pages 

Download Communications 

Simply record your message in an audio file, then use the built-in ... from virtually 
any source (address books, mail folders, databases, csv files. ... 

www.freedownloadmanager.org/downloads/27_c/index13.htm - 61k - Cached - Similar pa ges 

Office XP Professional Software Training CD-ROMs 
Section A: Introduction Open Word Open Dialog Box Files & Folders Open Documents 
Copy and Paste ... Fax: +44 (0) 121 248-2800 Email: products@cvision.co.uk. 
www.cvision.co.uk/cd/officexpprof. htm - 23k - Cached - Similar pages 



Microsoft Mail: Introduction to Messaging Standards 

'Microsoft Site Server 3.0 Search: Capacity and Performance Analysis ... jpeg, 

gif (still image data). Audio, audio or voice data. Video, mpeg. Application ... 

www.microsoft.com/technet/archive/mail/appndix1 .mspx - 205k - Cached - Similar pa ges 
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SOFTWARE - Biznetmall - Internet Sales 

Backs up favorite files/folders. Also creates highly customized self-extrac ... 
image grabber pro is the fastest and easiest way to search the Internet for ... 

www.biznetl .com/bm/software/ - 167k - Cached - Similar pages 

Google Groups results for searching internet usage analyzer indexing data files folders 
images audio files appointments email 

^^-v NewestShareware.com Issue #215 - alt.comp.shareware - Aug 21 , 2003 
^Cf IIxUA/EAeiUfA /EAE1M1 ... - relcom.archives - Jun 21, 1996 

HoBbie cftaMJibi Ha cfraftnoBOM ... - relcom.archives - Jun 10, 1996 

Try searching for searching internet usage analyzer indexing data files folders images 
audio files appointments email on Google Book Search 
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Web Results 1-10 of about 10,300,000 for automatically selecting data sources indexing . (1.11 seconds) 

IBM Tivoli Storage Manager for AIX: Administrator's Guide - Contents 
Moving Data from an Offsite Volume in a Copy Storage Pool * Procedure for Moving 
Data ... Setting Up Source and Target Servers for Virtual Volumes ... 

publib.boulder.ibm.com/infocenter/ tivihelp/v1r1/topic/com.ibm.itsmaixn.doc/anragd5302.htm - 124k - 
Cached - Similar pages 

IBM Tivoli Storage Manager for Macintosh: Backup-Archive Clients ... 

Other sources of online help * Backing up your data ... Encrypting data during 

backup or archive operation ... Selecting a management class for folders ... 

publib.boulder.ibm.com/infocenter/ tivihelp/v1M /topic/com. ibm.itsmc.doc/ans1 000002. htm - 47k - 

Cached - Similar pages 

f More results from publib.boulder.ibm.com ] 

HTML Techniques for Web Content Accessibility Guidelines 1.0 
Future browsers and assistive technologies will be able to automatically translate 
tables into linear sequences or navigate a table cell by cell if data is ... 
www.w3.org/TR/WCAG10-HTML-TECHS/ - 176k - Cached - Similar p ages 



Integrating Diverse Data Sources with Gadfly 2 

Integrating Diverse Data Sources with Gadfly 2 ... drinkers who frequent bars (this 

is a comment) select * from frequents ... 

www.python.org/workshops/2000-01/ proceedings/papers/watters/watters.html - 33k - Cached - Similar pages 
Indexing ActiveX Data Sources 

Indexing ActiveX Data Sources. An IndexJob provides two ways specify the text you 
... The example assumes that the RecordSet will be created using a SELECT ... 

support.dtsearch.com/webhelp/ dtengine/indexing_activex_data_sources.htm - 20k - Cached - Similar pages 
Working with Data Sources 

In the ColdFusion Administrator, under Data Sources, select the ODBC or native 
drivers ... Enter a name for the data source, select the appropriate driver, ... 

livedocs.macromedia.com/coldfusion/ 5.0/Using_ColdFusion_Studio/data3.htm - 13k - Cached - Similar pages 



Directions for Importing References/Citations 

If RefWorks does not automatically open, select "Click here to access your RefWorks 
... Select CAS SciFinder from Import Filter/Data Source dropdown menu ... 
www.lib.uchicago.edu/e/using/ bibtools/refworks/addingcites.html - 21k - Cached - Similar pages 

The Open University Library 

Within RefWorks select 'Import' from the 'References' drop down menu, select 'EDINA' 

from 'Import Filter/Data Source' and 'Index to the Times' from the ... 

Hbran/.open. ac.uk/help/refvvorksheip. html - 28k - Dec 3, 2005 - Cached - Si miiar pag es 



RefWorks Import Help @ the Libraries 

... Social Sciences, or Arts & Humanities Citation Index. Select records and mark 
... Use Import Filter/Data Source labeled Innovative Interfaces (INNOPAC) ... 
www.libraries.wright.edu/ services/retworks/importhelp.html - 26k - Cached - Similar pag es 
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UsingApt - documentation - tinysofa enterprise server 

update is used to resynchronize the package index files from their sources. ... 

For example, issuing apt-get install VsFtpD will automatically select the ... 

www.tinysofa.org/documentation/index.cgi7UsingApt - 13k - Dec 3, 2005 - Cached - Similar pages 

Try searching for automatically selecting data sources indexing on Google Book Search 
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Web Results 1 - 10 of about 2,440,000 for searching sources sparse representation data sources . (0.51 se 

Analysis of Sparse Representation and Blind Source Separation ... 
Analysis of Sparse Representation and Blind Source Separation. Yuanging Li ... 
First, sparse representation (factorization) of a data matrix is discussed. ... 
neco.mitpress.org/cgi/content/abstract/1 6/6/1 193 - Similar pages 

Sparse multicast - Wikipedia, the free encyclopedia 
Sparse mode multicast is one mode which multicast can use to construct a tree for 
... When a data source first sends to a group, its DR unicasts Register ... 
en.wikipedia.org/wiki/Sparse_multicast - 14k - Cached - Similar pages 

rPDFi SUPERRESOLUTION SOURCE LOCALIZATION THROUGH DATA-ADAPTIVE ... 

File Format: PDF/Adobe Acrobat - View as HTML 

If the number of sources. A. A. , sensor, data can be represented by a linear 
... number of sources with unique sparse representation in terms of an ... 
ssg.mit.edu/-dmm/publications/malioutov_SAM02.pdf - Similar pa ges 

Talk Abstracts: Digital Libraries: Data Modeling and Representation 

Traditional data compression algorithms or source codes were developed for ... 
We build an image search engine which retrieves images of a person, ... 
www.ima.umn.edu/multimedia/abstract/1-29abs.html - 49k - Cached - Similar pages 



Open Directory - Computers: Programming: Languages: Fortran ... 
Fortran Library Links - Gary Scott's extensive collection of source code links. 
... calls to the XDR (external Data Representation) routines from Fortran. ... 
dmo2.org/Computers/Programming/ Languages/Fortran/Source_Code/ - 41k - Cached - Similar pa ges 

How Database Snapshots Work 

Sparse files are a feature of the NTFS file system. As data is written to a sparse 
... The only exception is when the source database uses full-text search, ... 
msdn2.microsoft.com/en-us/library/ms187054.aspx - 27k - Cached - Similar pages 

Dynamic Brain Sources of Visual Evoked Responses - Makeig et al .„ 
Independent component analysis applied to the single-trial data identified at 
least eight ... Analysis of Sparse Representation and Blind Source Separation. ... 
www.sciencemag.org/cgi/content/abstract/295/5555/690 - Similar pages 

FROM THE WEB TO THE GLOBAL INFOBASE 
Approximate Caching for Continuous Queries over Distributed Data Sources . ... 
Search Over New Sources: We have been integrating new classes of information ... 
www-db.stanford.edu/-manning/GIBreport2002.html - 24k - Cached - Similar pa ges 

[ppt] CS267: Sources of Parallelism and Locality 
File Format: Microsoft Powerpoint 97 - View as HTML 

Sparse matrix(A). =. Improves cache reuse of source vector. Challenge: choosing 
a block size ... Sparse matrix is a representation of a (sparse) graph ... 
www.cs.berkeley.edu/-yelick/ cs267-sp04/lectures/16/lect16-sparse.ppt - Si milar pages 
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Mines and Minerals Division 

Southern Ontario has sparse representation as it was designated as the last area 
to be ... Source data is current to the time of the exploration survey. ... 

www.mndm.gov.on.ca/mndm/ mines/ermes/databases/drilmeta_e.asp - 29k - Cached - Similar pages 

Try searching for searching sources sparse representation data sources on Google Book 
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Web Results 1-10 of about 531,000 for filtering data source create sparse representation of data source. 

Scholarly articles for filtering data source create sparse representation of data source 

D Probabilistic Models for Unified Collaborative and ... - by Popescul - 57 citations 
Discovering Internet Marketing Intelligence through ... - by Buechner - 121 citations 
Organization of heterogeneous scientific data using the ... - by Nadkarni - 44 citations 

rPDFi DENOISING SOURCE SEPARATION: A NOVEL APPROACH TO 1CA AND FEATURE 

File Format: PDF/Adobe Acrobat - View as HTML 

The first source seem to vary more slowly than the other, one. Hence, a reasonable 
denoising scheme would be to. low-pass filter the data. ... 

www.cis.hut.fi/jaakkos/papers/sarela05clw_abs.pdf - Similar pages 

[pdf] Data Management Framework V2.1 
File Format: PDF/Adobe Acrobat - View as HTML 

This data source, allows data tables to be imported or created in memory, modified, 
... DMF Data Filters represent dynamic subsets of a DataView object. ... 
www.ewasystems.com/JavaDoc/EWA_DMF%20Architecture.pdf - Similar pa ges 

DBMS - September 1996 - Prospero 1.1 

The actual data is represented in a data source as a port. ... Prospero includes 
a Basic compiler that lets you create your own building blocks through ... 
www.dbmsmag.com/9609d08.html - 14k - Cached - Similar p ages 

rppn Project i-MARQ: Application of Data Fusion Techniques within ... 
File Format: PDF/Adobe Acrobat - View as HTML 

Such networks can be used to create a spatially rich data ... data/uncertainty 
representation for a variety of data sources. ... 

wwwJmarq.info/documents/iMARQ-ISESS%202003%20paper.pdf - Similar pa ges 

xml-dev - ACM Queue Special Issue on Semi-Structured Data 
... integration projects that provide searching over a federation of data sources, 
... XML provides a tool for representing and grappling with the data and ... 

lists.xmLorg/archives/xml-dev/20051 1/msg00076.html - 14k - Cached - Similar pages 

Frequently Asked Questions about OLAP and Microsoft Analysis ... 
Source Database: In data warehousing, the database from which data is extracted 
for use in the data warehouse. Sparsity: The relative percentage of a ... 
msdn.microsoft.com/library/ en-us/dnmda/html/odc_plapfaq.asp - 49k - Cached - Similar pa g es 



rppn INTEGRATION AND FILTERING OF 3D SPATIAL DATA USING A SURFACE ... 



File Format: PDF/Adobe Acrobat 

the means of creating a homogenous but filtered data set. That, is, firstly 
sources of differences between the source files need to ... 
www.isprs.org/commission4/proceedings/pdfpapers/432.pdf - Similar pa ges 

[ppt] www.statTutgers.edu/-madigan/mms/MMSpresentation9... 
File Format: Microsoft Powerpoint 97 - View as HTML 

Supervised Filtering. 6. Creating summary statistics on massive data streams ... 
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Multidimensional data: source, destination, time sent or received, ... 
Similar pages 

Section 18: Scientific Visualization 

data readers - input the data from the data source; data filters - convert ... 
that combines efficient volume projection with a sparse data representation. ... 

accad.osu.edu/-waynec/history/lesson18.html - 49k - Cached - Similar pages 

Glossary 

Data Scrubbing: The process of filtering, merging, decoding, and translating 
source data to create validated data for the data warehouse. ... 

www.dmreview.com/resources/glossary.cfm - 101k - Dec 3, 2005 - Cached - Similar pag es 

Try searching for filtering data source create sparse representation of data source on 

Google Book Search 
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Web Results 1 - 10 of about 8,090,000 for monitoring users automatically analyzing data sources . (0.26 se 

Windows AntiSpyware (Beta): Analysis approach and categories 

Monitoring programs, or software designed to monitor user activity, ... 

Microsoft examines a broad range of new and emerging data sources in analyzing and ... 

www.microsoft.com/athome/security/ spyware/software/isv/analysis.mspx - 48k - Cached - Similar pages 

Microsoft Reports Platform for End-User Reporting and Analysis 
This allows users to quickly consume and analyze data using familiar tools and 
interfaces without understanding the data sources and systems. ... 
www.microsoft.com/technet/ itsolutions/msit/busint/msreportsTCS.mspx - 59k - 
Cached - Similar pa ges 

Web Design Monitoring Reporting Mining Log 

(The Data Warehousing Information Center)- The topic of analyzing web data (also 
... Also, news sources will also be watched and summarized within a ... 

webdesign.ittoolbox.com/topics/ t.asp?t=417&p=417&h1=417 - 32k - Dec 3, 2005 - Cached - Similar pages 
Qvo Studios: Tape-Free Usability Labs 

For each user, Ovo Logger automatically tracks both time-on-task and the ... 
Analyzing User Data back to top. Search and Filter User Data Quickly and ... 
www.ovostudios.com/ovologger.asp - 21k - Cached - Similar pa ges 



What's New: Products 

Support for Continuous Real-time Decode, Allows users to continuously monitor 
and analyze a specific data stream without storage constraints ... 
www.netscout.com/products/whatsnew.asp - 54k - Cached - Similar pa ges 

STATISTICA Enterprise-Wide Data Analysis System (SEDAS) 
Automatic data monitoring/analysis; analytic auto-responding; ... power to define 
the specific permissions of users, the queries to external data sources, ... 

www.statsoft.com/products/sedas.html - 16k - Cached - Similar pages 



ReportWriting.com : Reporting and Analysis 
Create complex, multi-page layouts using different data sources without ... 
Using this intuitive HTML-only web solution, users access, analyze and share ... 
www.reportwriting.com/reportingandanalysis.asp - 39k - Cached - Similar pages 



Analyzing Requirements and Defining Microsoft.NET Solution ... 
SQLServer 2000 has two utilities for data transfer between data sources: ... 
An outline of the application monitoring process:. Analysis - what do you want? ... 
www.dotnetjohn.com/articles.aspx?articleid=195 - 32k - Cached - Similar pages 



Watershed/Water Quality Technical Support Center WCS Page, US EPA 
WCS provides users an initial set of watershed data along with analysis and ... 
for monitoring and intensive surveys, for nonpoint source assessments to ... 
www.epa.gov/athens/wwqtsc/html/wcs.html - 27k - Cached - Similar pa ges 

Syntricity: dataConductor & reportConductor- Web-based ... 
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Automated analysis flows within dataConductorEP enable users to automatically 
collect data and monitor yields, allowing them to focus on increasing yields. ... 
www.syntricity.com/Products/Overview.htm - 38k - Cached - Similar pages 

Try searching for monitoring users automatically analyzing data sources on Google Book 
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SAS | SAS Data Quality Solution 

By integrating data quality automatically into the ETL and data ... A Windows-friendly 
environment lets users analyze data, define business rules and create ... 
www.sas.com/technologies/dw/etl/dqcleanse/ - 32k - Cached - Similar pag es 

Miscellaneous software to monitor networks, servers, workstations ... 
Monitor your sites or servers and record any downtime for later analysis. ... 
data to file . automatic, manual mode . free data source . serial device ... 
www.monitortools.com/misc/ - 79k - Cached - Similar pages 

SEWASIE Prototype 

Monitoring is done either for a query that has been defined by the user, ... 

In general, the mapping from the data sources to the BA ontology is not simply ... 

www.sewasie.org/sewasie-prototype.htm - 26k - Cached - Similar pa ges 

RBNB WEB WHITE PAPER 

Built-in or user-supplied time-stamping for synchronization and cross-analysis. 
Scalable and Extendible. Data sources and monitoring stations located ... 
outlet.creare.com/rbnb/WP/WebWP/rbnbwp.html - 12k - Cached - Similar p ages 

Data Collection and Analysis 

Some data sources are restricted to University of Virginia affiliates where as 
... Allows a user to record audio interviews and analyze qualitative data. ... 
lrs.ed.uiuc.edu/tse-portal/ datacollectionmethodologies/j-mcmillan/datacollection.html - 16k - 
Cached - Similar pa ges 



Welcome to Inxight Software, Inc. 

Create link analysis and business intelligence applications that monitor ... 
ThingFinder Advanced can help users automatically extract and discover all ... 
www.inxight.com/products/sdks/tf/ - 23k - Cached - Similar pa ges 

Quality Magazine: Mining Factory Data 

Statserver is a Web-based system that uses Insightful's S-Plus software for data 
analysis, data mining and statistical modeling. It enables users to deploy ... 

www.qualitymag.com/CDA/Articlelnformation/ coverstory/BNPCoverStoryltem/0,6424 f 98995 J 00.html - 40k - Dec 
4, 2005 - Cached - Similar pa ges 



Silvaco - Products -SPAYN 

The data is then automatically grouped so that each parameter in a specific group 
is controlled by the same source of variation. Analysis of each parameter ... 
www.silvaco.com/products/ analog/vyper/spayn/spayn_datasheet.html - 25k - Cache d - Simi lar p a ges 

Automatised Night-time Scan Failure Alarm for Your Guest House ... 
Effective automatic monitoring can be set up easily, letting you sleep well ... 
To analyze the data automatically, we need a data analysis software that can ... 
www.esrf.fr/UsersAndScience/users_org/FailureAlarm - 20k - Dec 3, 2005 - Cached - Similar pages 
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Communications Software Monitoring | Orchestria.com ... 
Robust Message Analysis, Automatically extract all information ... Lookup, 
Reference external data sources and/or user and message attributes ... 
www.orchestria.com/products/features-8i-benefits/ - 147k - Cached - Similar pages 
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Windows AntiSpyware (Beta): Analysis approach and categories 

Monitoring programs, or software designed to monitor user activity, ... 

Microsoft examines a broad range of new and emerging data sources in analyzing and ... 

www.microsoft.com/athome/security/ spyware/software/isv/analysis.mspx - 48k - Cached - Similar pages 



Microsoft Reports Platform for End-User Reporting and Analysis 
This allows users to quickly consume and analyze data using familiar tools and 
interfaces without understanding the data sources and systems. ... 
www.microsoft.com/technet/ itsolutions/msit/busint/msreportsTCS.mspx - 59k - 
Cached - Similar pag es 

Web Design Monitoring Reporting Mining Log 

(The Data Warehousing Information Center)- The topic of analyzing web data (also 
... Also, news sources will also be watched and summarized within a ... 

webdesign.ittoolbox.com/topics/ t.asp?t=417&p=417&h1=417 - 32k - Dec 3, 2005 - Cached - Similar pag es 
Ovo Studios: Tape-Free Usability Labs 

For each user, Ovo Logger automatically tracks both time-on-task and the ... 
Analyzing User Data back to top. Search and Filter User Data Quickly and ... 
www.ovostudios.com/ovologger.asp - 21k - Cached - Similar pag es 



What's New: Products 

Support for Continuous Real-time Decode, Allows users to continuously monitor 
and analyze a specific data stream without storage constraints ... 
www.netscout.com/products/whatsnew.asp - 54k - Cached - Similar pages 



STATISTICA Enterprise-Wide Data Analysis System (SEDAS) 
Automatic data monitoring/analysis; analytic auto-responding; ... power to define 
the specific permissions of users, the queries to external data sources, ... 

www.statsoft.com/products/sedas.html - 16k - Cached - Similar pages 



ReportWriting.com : Reporting and Analysis 
Create complex, multi-page layouts using different data sources without ... 
Using this intuitive HTML-only web solution, users access, analyze and share ... 
www.reportwriting.com/reportingandanalysis.asp - 39k - Cached - Similar p ages 



Analyzing Requirements and Defining Microsoft.NET Solution .„ 
SQLServer 2000 has two utilities for data transfer between data sources: ... 
An outline of the application monitoring process:. Analysis - what do you want? ... 
www.dotnetjohn.com/articles.aspx?articleid=195 - 32k - Cached - Similar pages 

Waiershed/Water Quality Technical Support Center WCS Page. US EPA 
WCS provides users an initial set of watershed data along with analysis and ... 
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Automated analysis flows within dataConductorEP enable users to automatically 
collect data and monitor yields, allowing them to focus on increasing yields. ... 
www.syntricity.com/Products/Overview.htm - 38k - Cached - Similar pages 
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1 Web mining for web personalization 
Magdalini Eirinaki, Michalis Vazirgiannis 

February 2003 ACM Transactions on Internet Technology (TOIT), Volume 3 issue l 
Publisher: ACM Press 

Full text available* fifi pdf(293 73 KB) Additional Information: full citation , abstract , references , citings , index 
' To-^ -1 5 terms , review 

Web personalization is the process of customizing a Web site to the needs of specific 
users, taking advantage of the knowledge acquired from the analysis of the user's 
navigational behavior (usage data) in correlation with other information collected in the 
Web context, namely, structure, content, and user profile data. Due to the explosive 
growth of the Web, the domain of Web personalization has gained great momentum both 
in the research and commercial areas. In this article we present a survey ... 

Keywords: WWW, Web personalization, Web usage mining, user profiling 



2 Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Adva need 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available: ^] pdf(4.21 MB) Additional Information: full citation , abstract , references, index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 

3 To pical locality in the Web 
Brian D. Davison 

July 2000 Proceedings of the 23rd annual international ACM SIGIR conference on 
Research and development in information retrieval 

Publisher: ACM Press 

Full text available: ^ pdf(771.77 KB) Additional Information: full citation , abstract , references , citings , index 
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terms 

Most web pages are linked to others with related content. This idea, combined with 
another that says that text in, and possibly around, HTML anchors describe the pages to 
which they point, is the foundation for a usable World-Wide Web. In this paper, we 
examine to what extent these ideas hold by empirically testing whether topical locality 
mirrors spatial locality of pages on the Web. In particular, we find that the likelihood of 
linked pages having similar textual content to be ... 

4 The string B-tree: a new data structure for string search in external memory and its 
^ applications 

^ Paolo Ferragina, Roberto Grossi 

March 1999 Journal of the ACM ( JACM), Volume 46 issue 2 

Publisher: ACM Press 

Full text available- AO odf(363.37 KB) Additional Information: full citation , abstract, references , dtings, index 
^ terms 

We introduce a new text-indexing data structure, the String B-Tree, that can be seen as a 
link between some traditional external-memory and string-matching data structures. In a 
short phrase, it is a combination of B-trees and Patricia tries for internal-node indices that 
is made more effective by adding extra pointers to speed up search and update 
operations. Consequently, the String B-Tree overcomes the theoretical limitations of 
inverted files, B-trees, prefix B-trees, s ... 

Keywords: B-tree, Patricia trie, external-memory data structure, prefix and range 
search, string searching and sorting, suffix array, suffix tree, text index 



Ex periments in social data minin g : The TopicShop system 
Brian Amento, Loren Terveen, Will Hill, Deborah Hix, Robert Schulman 

March 2003 ACM Transactions on Computer-Human Interaction (TOCHI), volume 10 issue 
l 

Publisher: ACM Press 

Full text available- fiQ pdf(377.92 KB) Additlonal Information: full citation , abstract , references , citings , index 
'T^J-^— * : terms 

Social data mining systems enable people to share opinions and benefit from each other's 
experience. They do this by mining and redistributing information from computational 
records of social activity such as Usenet messages, system usage history, citations, or 
hyperlinks. Some general questions for evaluating such systems are: (1) is the extracted 
information valuable? and (2) do interfaces based on the information improve user task 
performance? We report here on TopicShop, a syst ... 

Keywords: Cocitation analysis, collaborative filtering, computer-supported cooperative 
work, information visualization, social filtering, social network analysis 



6 Industrial sessions: big data: The SPSS skyserver: public access to the sloan digital 
<|k sky server data 

^ Alexander S. Szalay, Jim Gray, Ani R. Thakar, Peter Z. Kunszt, Tanu Malik, Jordan Raddick, 
Christopher Stoughton, Jan vandenBerg 

June 2002 Proceedings of the 2CG2 ACM SIGMGD international conference on 
Management of data 

Publisher: ACM Press 

Full text available* fi3 Ddfd 48 MB) Additional Information: full citation , abstract , references , citings , index 
'™ terms 

The SkyServer provides Internet access to the public Sloan Digital Sky Survey (SDSS) 
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data for both astronomers and for science education. This paper describes the SkyServer 
goals and architecture. It also describes our experience operating the SkyServer on the 
Internet. The SDSS data is public and well-documented so it makes a good test platform 
for research on database algorithms and performance. 

Multimedia and visualization: Dynamic structuring of web information for access 
visualization 

Jess Y. S. Mak, Hong Va Leong, Alvin T. S. Chan 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing 
Publisher: ACM Press 

Full text available: ^ pdf(765.23 KB) Additional Information: full citation , abstract , references , index terms 

The Internet has led to the formation of a global information infrastructure. To explore a 
web site, a site map would be useful as a short cut for a user to locate for the target 
information in a structured and efficient manner, rather than drilling into the web site 
following hyperlinks, reading possibly irrelevant information. Useless information impacts 
a mobile web environment, where mobile clients are only connected with unreliable 
wireless channels of limited bandwidth. Structured web page ... 

Keywords: DOM, VRML, XML, visualization, web document structure 



Reusable software components 
Trudy Levine 

July 1996 ACM SIGAda Ada Letters, Volume xvi issue 4 
Publisher: ACM Press 

Full text available: fi3pdf(2.45 MB) Additional Information: full citation , index terms 



The state of the art in automating usability evaluation of user interfaces 
Melody Y. Ivory, Marti A Hearst 

December 2001 ACM Computing Surveys (CSUR), Volume 33 issue 4 
Publisher: ACM Press 

Full text available- ■a P df(2.31 MB) Additional Information: full citation, abstract, references , citings, index 
Ld = H ^ " terms , review 

Usability evaluation is an increasingly important part of the user interface design process. 
However, usability evaluation can be expensive in terms of time and human resources, 
and automation is therefore a promising way to augment existing approaches. This article 
presents an extensive survey of usability evaluation methods, organized according to a 
new taxonomy that emphasizes the role of automation. The survey analyzes existing 
techniques, identifies which aspects of usability evaluation aut ... 

Keywords: Graphical user interfaces, taxonomy, usability evaluation automation, web 
interfaces 



1 ° Scalable feature selection, classification and signature generation for or g anizing 

large text databases into hierarchical topic taxonomies 

Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan 

August 1998 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 7 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ g) pdf(281.37 KB) Additional Information: full citation , abstract , citings , index terms 
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We explore how to organize large text databases hierarchically by topic to aid better 
searching, browsing and filtering. Many corpora, such as internet directories, digital 
libraries, and patent databases are manually organized into topic hierarchies, also called 
taxonomies. Similar to indices for relational data, taxonomies make search and access 
more efficient. However, the exponential growth in the volume of on-line textual 
information makes it nearly impossible to maintain such taxono ... 

11 Web engineering with semantic annotation: A multilingual usage consultation tool 
based on internet searching: more than a search engine, less than QA 
Kumiko Tanaka-Ishii, Hiroshi Nakagawa 

May 2005 Proceedings of the 14th international conference on World Wide Web 
Publisher: ACM Press 

Full text available: fifl pdf(453.74 KB) Additional Information: full citation , abstract , references , index terms 



We present a usage consultation tool, based on Internet searching, for language learners. 
When a user enters a string of words for which he wants to find usages, the system sends 
this string as a query to a search engine and obtains search results about the string. The 
usages are extracted by performing statistical analysis on snippets and then fed back to 
the user.Unlike existing tools, this usage consultation tool is multi-lingual, so that usages 
can be obtained even in a language for which th ... 

Keywords: question answering, text mining, usage consultation 



12 Innovation, management & strategy: Virtual web services: application of software 
<g> agents to personalization of web services 
^ Jarogniew Rykowski, Wojciech Cellary 

March 2004 Proceedings of the 6th international conference on Electronic commerce 
ICEC04 

Publisher: ACM Press 

Full text available: ^ pdf(292.99 KB) Additional Information: full citation , abstract , references 

In this paper we propose an application of software agents to provide Virtual Web 
Services. A Virtual Web Service VWS is a linked collection of several real and/or virtual 
Web Services, and public and private agents, accessed by the user in the same way as a 
single real Web Service. A Virtual Web Service allows unrestricted comparison, 
information merging, pipelining, etc., of data coming from different sources and in 
different forms. Web Services are accessed according to t ... 

Keywords: customization, personalization, software agents, web services 



13 Ap plications: An analysis of Internet chat systems 
Christian Dewes, Arne Wichmann, Anja Feldmann 

October 2003 Proceedings of the 3rd ACM SIGCOMM conference on Internet 
measurement 

Publisher: ACM Press 

Full text available: ^ [ pdf(630.55 KB) Additional Information: full citation , references , citings , index terms 



Keywords: IRC, chat, network measurements 



14 LinkSelector: A Web mining approach to hyperlink selection for Web portals 
Xiao Fang, Olivia R. Liu Sheng 
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May 2004 ACM Transactions on Internet Technology (TOIT), Volume 4 issue 2 
Publisher: ACM Press 

Full text available: ^pdf(2.10MB) Additional Information: full citation , abstract , references , index terms 

As the size and complexity of Web sites expands dramatically, it has become increasingly 
challenging to design Web sites where Web surfers can easily find the information they 
seek. In this article, we address the design of the portal page of a Web site, which serves 
as the homepage of a Web site or a default Web portal. We define an important research 
problem— hyperlink selection: selecting from a large set of hyperlinks in a given Web site, 
a limited number of hyperlinks for inclusion in a po ... 

Keywords: Web mining 



15 Service selection and metadata: Automating metadata generation: the simp le 
<g> indexing interface 

^ Kris Cardinaels, Michael Meire, Erik Duval 

May 2005 Proceedings of the 14th international conference on World Wide Web 

Publisher: ACM Press 

Full text available: ^ pdf(302.39 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we focus on the development of a framework for automatic metadata 
generation. The first step towards this framework is the definition of an Application 
Programmer Interface (API), which we call the Simple Indexing Interface (SII). The 
second step is the definition of a framework for implementation of the SII. Both steps are 
presented in some detail in this paper. We also report on empirical evaluation of the 
metadata that the SII and supporting framework generated in a real-life c ... 

Keywords: learning objects, metadata generation 

16 Document searching, document annotation, and document metadata: Prefilterinq 
techniques for efficient XML document processing 

^ Chia-Hsin Huang, Tyng-Ruey Chuang, Hahn-Ming Lee 

November 2005 Proceedings of the 2005 ACM symposium on Document engineering 

DocEng '05 
Publisher: ACM Press 

Full text available: ^ pdf(442.96 KB) Additional Information: full citation , abstract , references , index terms 

Document Object Model (DOM) and Simple API for XML (SAX) are the two major 
programming models for XML document processing. Each, however, has its own efficiency 
limitation. DOM assumes an in-core representation of XML documents which can be 
problematic for large documents. SAX needs to scan over the document in a linear 
manner in order to locate the interesting fragments. Previously, we have used tree-to- 
table mapping and indexing techniques to help answer structural queries to large, or large 
c ... 

Keywords: DOM, SAX, prefiltering, structural query, two-phased XML processing model 



17 Searching the Web 

August 2001 ACM Transactions on Internet Technology (TOIT), volume l issue l 
Publisher: ACM Press 

Full text available* fi3 pdf(319,98 KB) Add ' tional Information: full citation , abstract , references , citings , index 
* ™ terms , review 

We offer an overview of current Web search engine design. After introducing a generic 
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search engine architecture, we examine each engine component in turn. We cover 
crawling, local Web page storage, indexing, and the use of link analysis for boosting 
search performance. The most common design and implementation techniques for each of 
these components are presented. For this presentation we draw from the literature and 
from our own experimental search engine testbed. Emphasis is on introduci ... 

Keywords: HITS, PageRank, authorities, crawling, indexing, information retrieval, link 
analysis, search engine 



18 Business-to-business interactions: issues and enabling technologies 
B. Medjahed, B. Benatallah, A. Bouguettaya, A. H. H. Ngu, A. K. Elmagarmid 
May 2003 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 12 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^g] pdf(558.34 KB) Additional Information: full citation , abstract, citings , index terms 

Business-to-Business (B2B) technologies pre-date the Web. They have existed for at least 
as long as the Internet. B2B applications were among the first to take advantage of 
advances in computer networking. The Electronic Data Interchange (EDI) business 
standard is an illustration of such an early adoption of the advances in computer 
networking. The ubiquity and the affordability of the Web has made it possible for the 
masses of businesses to automate their B2B interactions. However, several issu ... 

Keywords: B2B Interactions, Components, E-commerce, EDI, Web services, Workflows, 
XML 



19 Performance and cost tradeoffs in Web search I 
Nick Craswell, Francis Crimmins, David Hawking, Alistair Moffat 

January 2004 Proceedings of the fifteenth conference on Australasian database - 
Volume 27 CRPIT'04 

Publisher: Australian Computer Society, Inc. 

Full text available: ^) pdf(1 53.92 KB) Additional Information: full citation , abstract , references , citings 

Web search engines crawl the web to fetch the data that they index. In this paper we re- 
examine that need, and evaluate the network costs associated with data acquisition, and 
alternative ways in which a search service might be supported. As a concrete example, we 
make use of the Research Finder search service provided at http://rf.panopticsearch.com, 
and information derived from its crawl and query logs. Based upon an analysis of the 
Research Finder system we introduce a hybrid arrangement, in ... 

Keywords: Web crawling, World-Wide Web, information retrieval, metasearch, search 
engine 



20 An intelligent distributed environment for active learning 

August 2001 Journal on Educational Resources in Computing (JERIC) 
Publisher: ACM Press 

Full text available* odfd 16 71 KB) Ac,c,it ' ona, Information: full citation , abstract , references , citings, index 



rms, review 

Active learning is an effective learning approach. In this article we present an intelligent 
agent-assisted environment for active learning to better support the student-centered, 
selfpaced, and highly interactive learning approach. The environment uses the students 
learningrelated profile such as learning style and background knowledge in selecting, 
organizing, and presenting learning material, and it adopts a new approach to course 



http://portal.acm.org/resul^ 12/5/05 



Results (page 1): web pages searching internet usage analyzer indexing data Page 7 of 7 

content organization and delivery based on smart instruct ... 

Keywords: XML, active learning, multiagent system, web-based education 

Results 1 - 20 of 200 Result page: 123456Z8910 next 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2005 ACM, Inc. 
Terms of Usage Privacy Policy Code of Ethics Contact Us 

Useful downloads: B Adobe Acrobat 0 QuickTime B Windows Media Player ^> Real Player 



http://portal.acm.org/results.cfm?coll=ACM&dl=ACM&CFID=62216407&CFTOKEN=721... 12/5/05 



Results (page 1): searching internet usage analyzer indexing data files folders images audi... Page 1 of 6 



A P0RTAL 



USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: ® The ACM Digital Library C The Guide 
[searching internet usage analyzer indexing data files folders in] 



I Feedback Report a problem Satisfaction surve\ 

Four 

Terms used 27,6* 
searching internet usage analyzer indexing data files folders images audio files appointments email 

167,6! 



Sort results | re | evance p| fe save results to a Binder Try an Advanced Search 

by I «a Try this search in The A( 

— t ^ Search Tips 

expanded form ^ □ Open results in a new window 



ACM Guide 



Display 
results 



Result page: 123456Z8910 



Results 1 - 20 of 200 
Best 200 shown 

1 Experiments in social data mining: The TopicShop system 

Brian Amento, Loren Terveen, Will Hill, Deborah Hix, Robert Schulman 



next 

Relevance scale □ U H B I 



March 2003 ACM Transactions on Computer-Human Interaction (TOCHI), volume 10 issue l 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: pdf(377.92 KB) 



Social data mining systems enable people to share opinions and benefit from each other's 
experience. They do this by mining and redistributing information from computational 
records of social activity such as Usenet messages, system usage history, citations, or 
hyperlinks. Some general questions for evaluating such systems are: (1) is the extracted 
information valuable? and (2) do interfaces based on the information improve user task 
performance? We report here on TopicShop, a syst ... 

Keywords: Cocitation analysis, collaborative filtering, computer-supported cooperative 
work, information visualization, social filtering, social network analysis 



Client-server computing in mobile environments 

Jin Jing, Abdelsalam Sumi Helal, Ahmed Elmagarmid 

June 1999 ACM Computing Surveys (CSUR), volume 31 issue 2 

Publisher: ACM Press 

Full text available* fi3 odf(233 31 KB) Additional Information: full citation , abstract , references , citings , index 
k^ - * : terms , review 

Recent advances in wireless data networking and portable information appliances have 
engendered a new paradigm of computing, called mobile computing, in which users carrying 
portable devices have access to data and information services regardless of their physical 
location or movement behavior. In the meantime, research addressing information access in 
mobile environments has proliferated. In this survey, we provide a concrete framework and 
categorization of the various way ... 

Keywords: application adaptation, cache invalidation, caching, client/server, data 
dissemination, disconnected operation, mobile applications, mobile client/server, mobile 
compuing, mobile data, mobility awareness, survey, system application 
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Keywords: auditory user interface, digitized speech, interactive voice response, speech 
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Human interaction: Stuff I've seen: a system for personal information retrieval and re- g 
use 

Susan Dumais, Edward Cutrell, JJ Cadiz, Gavin Jancke, Raman Sarin, Daniel C. Robbins 
July 2003 Proceedings of the 26th annual international ACM SIGIR conference on 

Research and development in informaion retrieval 
Publisher: ACM Press 

Full text available- « pdff372.80 KB) AdditionaI Information: full citation , abstract, references , citings, index 
* leH*-* : terms 

Most information retrieval technologies are designed to facilitate information discovery. 
However, much knowledge work involves finding and re-using previously seen information. 
We describe the design and evaluation of a system, called Stuff I've Seen (SIS), that 
facilitates information re-use. This is accomplished in two ways. First, the system provides a 
unified index of information that a person has seen, whether it was seen as email, web page, 
document, appointment, etc. Second, becau ... 

Keywords: interactive information retrieval, personal information management, user 
interfaces, user studies 



m-links: An infrastructure for very small internet devices 
Bill N. Schilit, Jonathan Trevor, David M. Hilbert, Tzu Khiau Koh 

July 2001 Proceedings of the 7th annual international conference on Mobile computing 
and networking 

Publisher: ACM Press 

Full text available* 153 df(680 78 KB) Additional Information: full citation , abstract , references , citings , index 
™ terms 

In this paper we describe the Mobile Link (m-Links) infrastructure for utilizing existing World 
Wide Web content and services on wireless phones and other very small Internet terminals. 
Very small devices, typically with 3-20 lines of text, provide portability and other 
functionality while sacrificing usability as Internet terminals. In order to provide access on 
such limited hardware we propose a small device web navigation model that is more 
appropriate than the desktop computer's web brows ... 

Keywords: middleware, proxy, web phones, wireless, wireless web 
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Tim Leehane 

September 1996 Proceedings of the 24th annual ACM SIGUCCS conference on User 

services 
Publisher: ACM Press 

Full text available: Additional Information: 
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Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Publisher: IBM Press 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

Pen computing: a technology overview and a vision 
Andre Meyer 

July 1995 ACM SIGCHI Bulletin, Volume 27 Issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(5.14 MB) Additional Information: full citation , abstract , citings , index terms 

This work gives an overview of a new technology that is attracting growing interest in public 
as well as in the computer industry itself. The visible difference from other technologies is in 
the use of a pen or pencil as the primary means of interaction between a user and a 
machine, picking up the familiar pen and paper interface metaphor. From this follows a set 
of consequences that will be analyzed and put into context with other emerging technologies 
and visions. Starting with a short historic ... 

9 An analysis of XML database solutions for the management of MPEG-7 media 
4& descriptions 

^ Utz Westermann, Wolfgang Klas 

December 2003 ACM Computing Surveys (CSUR), volume 35 issue 4 
Publisher: ACM Press 

Full text available- fg| pdf(448 76 KB) Additional Information: full citation, abstract, references, index terms, 
. i^j-fcL-o review 

MPEG-7 constitutes a promising standard for the description of multimedia content. It can be 
expected that a lot of applications based on MPEG-7 media descriptions will be set up in the 
near future. Therefore, means for the adequate management of large amounts of MPEG-7- 
compliant media descriptions are certainly desirable. Essentially, MPEG-7 media descriptions 
are XML documents following media description schemes defined with a variant of XML 
Schema. Thus, it is reasonable to investigate curren ... 

Keywords: MPEG-7, XML database systems, multimedia databases 
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September 1999 interactions, Volume 6 issue 5 
Publisher: ACM Press 
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11 Model-driven development of Web applications: the AutoWeb system 
Piero Fraternali, Paolo Paolini 

October 2000 ACM Transactions on Information Systems (TOIS), Volume 18 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 



Full text available: _ _ _ 

L - r ^ terms 

This paper describes a methodology for the development of WWW applications and a tool 
environment specifically tailored for the methodology. The methodology and the 
development environment are based upon models and techniques already used in the 
hypermedia, information systems, and software engineering fields, adapted and blended in 
an original mix. The foundation of the proposal is the conceptual design of WWW 
applications, using HDM-lite, a notation for the specification of structure, nav ... 

Keywords: HTML, WWW, application, development, intranet, modeling 

12 Safely executing untrusted code: Model-carrying code: a practical approach for safe |j 
^ execution of untrusted applications 

^ R. Sekar, V.N. Venkatakrishnan, Samik Basu, Sandeep Bhatkar, Daniel C. DuVarney 

October 2003 Proceedings of the nineteenth ACM symposium on Operating systems 

principles 
Publisher: ACM Press 

Full text available* pdf(301 30 KB) Additional Information: full citation , abstract , references , citings , index 

: terms 

This paper presents a new approach called model-carrying code (MCC) for safe execution of 
untrusted code. At the heart of MCC is the idea that untrusted code comes equipped with a 
concise high-level model of its security-relevant behavior. This model helps bridge the gap 
between high-level security policies and low-level binary code, thereby enabling analyses 
which would otherwise be impractical. For instance, users can use a fully automated 
verification procedure to determine if the code ... 

Keywords: mobile code security, policy enforcement, sand-boxing, security policies 

13 Service selection and metadata: Automating metadata generation: the simple indexing ^ 
interface 

Kris Cardinaels, Michael Meire, Erik Duval 

May 2005 Proceedings of the 14th international conference on World Wide Web 
Publisher: ACM Press 

Full text available: Q pdf(302.39 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we focus on the development of a framework for automatic metadata 
generation. The first step towards this framework is the definition of an Application 
Programmer Interface (API), which we call the Simple Indexing Interface (SII). The second 
step is the definition of a framework for implementation of the SII. Both steps are presented 
in some detail in this paper. We also report on empirical evaluation of the metadata that the 
SII and supporting framework generated in a real-life c ... 

Keywords: learning objects, metadata generation 
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Wendy Ju, Arna Ionescu, Lawrence Neeley, Terry Winograd 

November 2004 Proceedings of the 2004 ACM conference on Computer supported 
cooperative work 

Publisher: ACM Press 

Full text available: ^ pdf(1.31 MB) Additional Information: full citation , abstract , references , index terms 

We have built and tested WorkspaceNavigator, which supports knowledge capture and reuse 
for teams engaged in unstructured, dispersed, and prolonged collaborative design activity in 
a dedicated physical workspace. It provides a coherent unified interface for post-facto 
retrieval of multiple streams of data from the work environment, including overview 
snapshots of the workspace, screenshots of in-space computers, whiteboard images, and 
digital photos of physical objects. This paper describes t ... 

Keywords: collaborative design, knowledge capture/ re use, memory augmentation, physical 
environments, workspaces 
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16 Image Retrieval from the World Wide Web: Issues. Techniques, and Systems i 
M. L. Kherfi, D. Ziou, A. Bernardi 

March 2004 ACM Computing Surveys (CSUR), Volume 36 issue l 
Publisher: ACM Press 

Full text available: ^ pdf(294.13 KB) Additional Information: full citation , abstract , references , index terms 

With the explosive growth of the World Wide Web, the public is gaining access to massive 
amounts of information. However, locating needed and relevant information remains a 
difficult task, whether the information is textual or visual. Text search engines have existed 
for some years now and have achieved a certain degree of success. However, despite the 
large number of images available on the Web, image search engines are still rare. In this 
article, we show that in order to allow people to profi ... 

Keywords: Image-retrieval, World Wide Web, crawling, feature extraction and selection, 
indexing, relevance feedback, search, similarity 
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Shirley Browne, Jack Dongarra, Jeff Horner, Paul McMahan, Scott Wells 
May 1998 Proceedings of the third ACM conference on Digital libraries 
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Full text available: ^ pdf(1.14 MB) Additional Information: full citation , references , citings, index terms 
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19 Strategic directions in database systems — breaking out of the box 
Avi Silberschatz, Stan Zdonik 

December 1996 ACM Computing Surveys (CSUR), volume 28 issue 4 
Publisher: ACM Press 

Full text available: ^| pdf(222.64 KB) Additional Information: full citation , references , citings , index terms 



20 Web Behavior Patterns: How knowledge workers use the web 
Abigail J. Sellen, Rachel Murphy, Kate L. Shaw 

April 2002 Proceedings of the SIGCHI conference on Human factors in computing 

systems: Changing our world, changing ourselves 
Publisher: ACM Press 

Full text available- fifl pdf(425.34 KB) Additional Information: full citation , abstract , references , citings , index 
" terms 

We report on a diary study of how and why knowledge workers use the World Wide Web. By 
examining in detail a complete two-day set of Web activities from each of 24 people, we 
construct a framework with which to describe the different tasks knowledge workers 
undertake. By looking at the characteristics of each type of activity, we can see how certain 
activities are unsuited to particular kinds of technologies (e.g., mobile devices); how Web 
tools might be incrementally improved; and how we might ... 

Keywords: World Wide Web, appliances, diary study, knowledge workers, mobile 
technology, taxonomy 
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16 Deconstructing Commodity Storage Clusters Q 
Haryadi S. Gunawi, Nitin Agrawal, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Jiri 
Schindler 

June 2005 Proceedings of the 32nd Annual International Symposium on Computer 
Architecture ISCA '05 

Publisher: IEEE Computer Society 

Full text available: ^ pdf(269.90 KB) Additional Information: full citation , abstract 

The traditional approach for characterizing complex systems is to run standard workloads 
and measure the resulting performance as seen by the end user. However, unique 
opportunities exist when characterizing a system that is itself constructed from 
standardized components: one can also look inside the system itself by instrumenting 
each of the components. In this paper, we show how intra-box instrumentation can help 
one understand the behavior of a large-scale storage cluster, the EMC Centera. I ... 

17 The Roma personal metadata service Q 
Edward Swierk, Emre Kiciman, Nathan C. Williams, Takashi Fukushima, Hideki Yoshida, Vince 
Laviano, Mary Baker 

October 2002 Mobile Networks and Applications, volume 7 issue 5 
Publisher: Kluwer Academic Publishers 

Full text available: ^ pdf(221.38 KB) Additional Information: full citation , abstract , references , index terms 

People now have available to them a diversity of digital storage facilities, including 
laptops, cell phone address books, handheld devices, desktop computers and web-based 
storage services. Unfortunately, as the number of personal data repositories increases, so 
does the management problem of ensuring that the most up-to-date version of any 
document in a user's personal file space is available to him on the storage facility he is 
currently using. We introduce the Roma personal metadata service t ... 

Keywords: data synchronization, distributed data storage, distributed databases, 
metadata, mobile computing, personal systems 



18 A Metadat a Catalog Service for Data Intensive Applications Q 
Gurmeet Singh, Shishir Bharathi, Ann Chervenak, Ewa Deelman, Carl Kesselman, Mary 
Manohar, Sonal Patil, Laura Pearl man 

November 2003 Proceedings of the 2003 ACM/IEEE conference on Supercomputing 
Publisher: IEEE Computer Society 

Full text available: *g| pdf(1 78.25 KB) Additional Information: full citation , abstract 

Advances in computational, storage and network technologies as well as middle ware such 
as the Globus Toolkit allow scientists to expand the sophistication and scope of data- 
intensive applications. These applications produce and analyze terabytes and petabytes of 
data that are distributed in millions of files or objects. To manage these large data sets 
efficiently, metadata or descriptive information about the data needs to be managed. 
There are various types of metadata, and it is likely that a ... 

19 Conceptual modeling and metadata: Grid metadata catalog service-based OGC web Q 
<g> registry service 

^ Peisheng Zhao, Aijun Chen, Yang Liu, Liping Di, Wenli Yang, Peichuan Li 

November 2004 Proceedings of the 12th annual ACM international workshop on 

Geographic information systems 
Publisher: ACM Press 

Full text available: *Q pdf(1 28.43 KB) Additional Information: full citation , abstract , references , index terms 

Grid is a promising e-Science infrastructure that promotes and facilitates the sharing and 
collaboration in the use of distributed heterogeneous resources through Virtual 
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Organization (VO). A critical factor to the overall utility of Grid is a scalable, flexible and 
robust registry mechanism. Although it provides some mechanisms to store and access 
metadata for publishing and discovering resources, such as MCS (Metadata Catalog 
Service), the Grid registry is inadequate for dealing with domain ... 

Keywords: OGC, OWL, catalog, grid, information model, ontology, semantic 



20 Serverless network file systems 

Thomas E. Anderson, Michael D. Dahlin, Jeanna M. Neefe, David A. Patterson, Drew S. 
Roselli, Randolph Y. Wang 

February 1996 ACM Transactions on Computer Systems (TOCS), Volume 14 issue i 
Publisher: ACM Press 

Full text available* ti P p df(2.69 MB) Additional Information: full citation , abstract , references , citings, index 
" terms 

We propose a new paradigm for network file system design: serverless network file 
systems. While traditional network file systems rely on a central server machine, a 
serverless system utilizes workstations cooperating as peers to provide all file system 
services. Any machine in the system can store, cache, or control any block of data. Our 
approach uses this location independence, in combination with fast local area networks, to 
provide better performance and scalability th ... 

Keywords: RAID, log cleaning, log structured, log-based striping, logging, redundant 
data storage, scalable performance 
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1 Concurrent search structure algorithms | 
Dennis Shasha, Nathan Goodman 

March 1988 ACM Transactions on Database Systems (TODS), Volume 13 issue l 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms , review 

A dictionary is an abstract data type supporting the actions member, insert, and delete. A 
search structure is a data structure used to implement a dictionary. Examples include B 
trees, hash structures, and unordered lists. Concurrent algorithms on search structures 
can achieve more parallelism than standard concurrency control methods would suggest, 
by exploiting the fact that many different search structure states represent one dictionary 
state. We present a framework for verifying such a ... 

2 Searching for deadlocks while debugg in g concurrent haskell programs j 
Jan Christiansen, Frank Huch 

September 2004 ACM SIGPLAN Notices , Proceedings of the ninth ACM SIGPLAN 

international conference on Functional programming ICFP '04, volume 
39 Issue 9 
Publisher: ACM Press 

Full text available: ^j? ) pdf(1 25.75 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents an approach to searching for deadlocks in Concurrent Haskell 
programs. The search is based on a redefinition of the IO monad which allows the reversal 
of Concurrent Haskells concurrency primitives. Hence, it is possible to implement this 
search by a backtracking algorithm checking all possible schedules of the system. It is 
integrated in the Concurrent Haskell Debugger (CHD), and automatically searches for 
deadlocks in the background while debugging. The tool is easy to use a ... 



Keywords: concurrent haskell, deadlock, debugging, detecting deadlocks 



3 Peer-to-peer systems for prefix search 

Baruch Awerbuch, Christian Scheideler 
v July 2003 Proceedings of the twenty-second annual symposium on Principles of 
distributed computing 

Publisher: ACM Press 

Full text available: fgl pdf(944.28 KB) Additional Information: full citation , abstract , references , citings, index 
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terms 

This paper presents a general methodology for building message-passing peer-to-peer 
systems capable of performing prefix search for arbitrary user-defined names. Our 
methodology allows to achieve even load distribution, high fault-tolerance, and low- 
congestion concurrent query execution. This is the, first known peer-to-peer system for 
prefix search with such properties. The essence of this methodology is a plug and play 
paradigm for designing a peer-to-peer system as a modular composition of ar ... 

Poster papers: What's the code?: automatic classification of source code archives 
Secil Ugurel, Robert Krovetz, C. Lee Giles 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: | ^pdft759.11 KB) Additional Information: full citation , abstract , references , index terms 

There are various source code archives on the World Wide Web. These archives are 
usually organized by application categories and programming languages. However, 
manually organizing source code repositories is not a trivial task since they grow rapidly 
and are very large (on the order of terabytes). We demonstrate machine learning 
methods for automatic classification of archived source code into eleven application topics 
and ten programming languages. For topical classification, we concentrate on ... 

Locking without blocking: making lock based concurrent data structure algorithms 
nonblocking 

John Turek, Dennis Shasha, Sundeep Prakash 

July 1992 Proceedings of the eleventh ACM SIGACT-SIGMOD-SIGART symposium on 
Principles of database systems 

Publisher: ACM Press 

Full text available- I flpdffLQg MB) Additional Information: full citation, abstract, references, citings, index 
' terms 

Nonblocking algorithms for concurrent data structures guarantee that a data structure is 
always accessible. This is in contrast to blocking algorithms in which a slow or halted 
process can render part or all of the data structure inaccessible to other processes. This 
paper proposes a technique that can convert most existing lock-based blocking data 
structure algorithms into nonblocking algorithms with the same functionality. Our 
instruction-by-instruction transformation can be ap ... 

Tools and approaches for developing data-intensive Web applications: a survey 
Piero Fraternali 

September 1999 ACM Computing Surveys (CSUR), Volume 31 issue 3 
Publisher: ACM Press 

Full text available* f 3 pdf(524 80 KB) Add 't'°nal Information: full citation , abstract , references , citings , index 

terms 

The exponential growth and capillar diffusion of the Web are nurturing a novel generation 
of applications, characterized by a direct business-to-customer relationship. The 
development of such applications is a hybrid between traditional IS development and 
Hypermedia authoring, and challenges the existing tools and approaches for software 
production. This paper investigates the current situation of Web development tools, both 
in the commercial and research fields, by identifying and characte ... 

Keywords: HTML, Intranet, WWW, application, development 
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Stefan Savage, Michael Burrows, Greg Nelson, Patrick Sobalvarro, Thomas Anderson 
November 1997 ACM Transactions on Computer Systems (TOCS), Volume 15 issue 4 

Publisher: ACM Press 

Full text available* fiQ pdfd 36.04 KB) Additional Information: full citation , abstract , references , citings , index 
' ^^-^ : terms 

Multithreaded programming is difficult and error prone. It is easy to make a mistake in 
synchronization that produces a data race, yet it can be extremely hard to locate this 
mistake during debugging. This article describes a new tool, called Eraser, for dynamically 
detecting data races in lock-based multithreaded programs. Eraser uses binary rewriting 
techniques to monitor every shared-monory reference and verify that consistent locking 
behavior is observed. We present several case studies ... 

Keywords: binary code modification, multithreaded programming, race detection 



8 Eraser: a dynamic data race detector for multi-threaded programs 

Stefan Savage, Michael Burrows, Greg Nelson, Patrick Sobalvarro, Thomas Anderson 
October 1997 ACM SIGOPS Operating Systems Review , Proceedings of the sixteenth 
ACM symposium on Operating systems principles SOSP '97, volume 31 issue 
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Publisher: ACM Press 

Full text available: ^pdf(1.51 MB) Additional Information: full citation , references , citings , index terms 




WSQ/DSQ: a practical approach for combined querying of databases and the Web 
Roy Goldman, Jennifer Widom 

May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 

conference on Management of data SIGMOD '00, Volume 29 issue 2 
Publisher: ACM Press 

Full text available - f*l odf(223 65 KB) Ad ° , iti° na l Information: full citation , abstract , references , citings , index 
: terms 

We present WSQ/DSQ (pronounced "wisk-disk"), a new approach for combining the query 
facilities of traditional databases with existing search engines on the Web. WSQ, for Web- 
Supported (Database) Queries, leverages results from Web searches to enhance SQL 
queries over a relational database. DSQ, for Database -Supported (Web) Queries, uses 
information stored in the database to enhance and explain Web searches. This paper 
focuses primarily on WSQ, describing a simple, lo ... 

10 Evaluating top-/c queries over web-accessible databases 
yjfik Amelie Marian, Nicolas Bruno, Luis Gravano 

>^ June 2004 ACM Transactions on Database Systems (TODS), Volume 29 issue 2 
Publisher: ACM Press 

Full text available: ^ pdf(1.03MB) Additional Information: full citation , abstract , references , index terms 

A query to a web search engine usually consists of a list of keywords, to which the search 
engine responds with the best or "top" k pages for the query. This top-/c query model is 
prevalent over multimedia collections in general, but also over plain relational data for 
certain applications. For example, consider a relation with information on available 
restaurants, including their location, price range for one diner, and overaii food rating. A 
user who queries such a relation might ... 

Keywords: Parallel query processing, query optimization, top-/c query processing, web 
databases. 
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11 Hypercube data analysis in astronomy: optical interferometry and millisecond pulsar Q 
searches 

P. Gorham, T. Prince, S. Anderson 

January 1989 Proceedings of the third conference on Hypercube concurrent 
computers and applications - Volume 2 

Publisher: ACM Press 

Full text available: ffl pdf(524.45 KB) Additiona ' Information: full citation , abstract, references , citings, index 
^ terms 

Astronomical data sets are beginning to live up to their name, in both their sizes and the 
complexity of the analysis required. Here we discuss two astronomical data analysis 
problems which we have begun to implement on a hypercube concurrent processor 
environment: The intensive image processing required in an optical interferometry 
project, and the large scale power spectral analysis required by a search for millisecond- 
period radio pulsars. In both cases the analysis proceeds largely in t ... 




12 Algorithms and data structures: Concurrent cache-oblivious b-trees 
Michael A. Bender, Jeremy T. Fineman, Seth Gilbert, Bradley C Kuszmaul 
July 2005 Proceedings of the 17th annual ACM symposium on Parallelism in 

algorithms and architectures SPAA'05 
Publisher: ACM Press 

Full text available: ^pdf(1 80.51 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents concurrent cache-oblivious (CO) B-trees. We extend the cache- 
oblivious model to a parallel or distributed setting and present three concurrent CO En- 
trees. Our first data structure is a concurrent lock-based exponential CO B-tree. This data 
structure supports insertions and non-blocking searches/successor queries. The second 
and third data structures are lock-based and lock-free variations, respectively, on the 
packed-memory CO B-tree. These data structures support range queri ... 

Keywords: cache-oblivious b-tree, concurrent b-tree, exponential tree, lock free, non- 
blocking, packed-memory array 





13 Semantic analysis in a concurrent compiler 

V. Seshadri, S. Weber, D. B. Wortman, C. P. Yu, I. Small 
V June 1988 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1988 conference 
on Programming Language design and Implementation PLDI '88, volume 23 
Issue 7 
Publisher: ACM Press 

Full text available* fiE ) pdf(805 25 KB) Additional Information: full citation , abstract , references , citings , index 

terms 

Traditional compilers are usually sequential programs that serially process source 
programs through lexical analysis, syntax analysis, semantic analysis and code 
generation. The availability of multiprocessor computers has made it feasible to consider 
alternatives to this serial compilation process. The authors are currently engaged in a 
project to devise ways of structuring compilers so that they can take advantage of 
modern multiprocessor hardware. This paper is about the most difficult a ... 

14 A concurrent compiler for Modula-2+ 
David B. Wortman, Michael D. Junkin 

July 1992 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1992 conference 
on Programming language design and implementation PLDI '92, volume 27 

Issue 7 
Publisher: ACM Press 

Full text available: Q pdfd.21 MB) Additional Information: full citation , abstract , index terms 
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In this paper we describe a collection of techniques for the design and implementation of 
concurrent compilers. We begin by describing a technique for dividing a source program 
into many streams so that each stream can be compiled concurrently. We discuss several 
compiler design issues unique to concurrent compilers including source program 
partitioning, symbol table management, compiler task scheduling and information flow 
constraints. The application of our techniques is ... 

15 Database systems: achievements and opportunities 
October 1991 Communications of the ACM, Volume 34 issue 10 
Publisher: ACM Press 

Full text available: fjfl pdf(4.03 MB) Additional Information: full citation , citings , index terms 



16 Session 4A: Family trees: an ordered dictionary with optimal congestion, locality. 

de gree, and search time 

Kevin C. Zatloukal, Nicholas J. A. Harvey 

January 2004 Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete 
algorithms 

Publisher: Society for Industrial and Applied Mathematics 

Full text available: |^ pdf(314.06 KB) Additional Information: full citation , abstract , references 

We consider the problem of storing an ordered dictionary data structure over a distributed 
set of nodes. In contrast to traditional sequential data structures, distributed data 
structures should ideally have low congestion. We present a novel randomized data 
structure, called a family tree, to solve this problem. A family tree has optimal expected 
congestion, uses only a constant amount of state per node, and supports searches and 
node insertion/deletion in expected O(log n) ... 

17 Parallel execution of prolog programs: a survey 
Gopal Gupta, Enrico Pontelli, Khayri A.M. AM, Mats Carlsson, Manuel V. Hermenegildo 
July 2001 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 23 Issue 4 
Publisher: ACM Press 

Full text available- 155 odfd 95 MB) AdditionaI Information: full citation , abstract , references , citings, index 
' l£H*-^ terms 

Since the early days of logic programming, researchers in the field realized the potential 
for exploitation of parallelism present in the execution of logic programs. Their high-level 
nature, the presence of nondeterminism, and their referential transparency, among other 
characteristics, make logic programs interesting candidates for obtaining speedups 
through parallel execution. At the same time, the fact that the typical applications of logic 
programming frequently involve irregular computatio ... 

Keywords: Automatic parallelization, constraint programming, logic programming, 
parallelism, prolog 
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Ehud Shapiro 

September 1989 ACM Computing Surveys (CSUR), Volume 21 issue 3 
Publisher: ACM Press 

Full text available* fi 3 pdf(9.62 MB) Additional Information: full citation , abstract , references , citings , index 
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Concurrent logic languages are high-level programming languages for parallel and 
distributed systems that offer a wide range of both known and novel concurrent 
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programming techniques. Being logic programming languages, they preserve many 
advantages of the abstract logic programming model, including the logical reading of 
programs and computations, the convenience of representing data structures with logical 
terms and manipulating them using unification, and the amenability to 
metaprogrammin ... 

19 Ada debugging and testing support environments 
Richard E. Fairley 

November 1980 ACM SIGPLAN Notices , Proceeding of the ACM-SIGPLAN symposium 

on Ada programming language SIGPLAN '80, Volume 15 issue n 
Publisher: ACM Press 

Full text available: ^ pdf(975.77 KB) Additional Information: full citation , references , citings 



20 Supporting semantic information retrieval in communication networks by multimedia Q 

^ techniques 

^ Annelise Mark Pejtersen 

March 1995 ACM SIGIR Forum, Volume 29 Issue 1 

Publisher: ACM Press 

Full text available: ^ pdf(795.98 KB) Additional Information: full citation , abstract , index terms 

The aim of the project is to understand and describe the information needs and search 
behavior of a professional design team as a basis for formulation of specifications of an 
information system that effectively supports the access to a wide network of 
heterogeneous databases. This project will supplement a similar project focused on the 
needs of casual users in public libraries and thus serve to generalize from previous 
research. To limit the scope of the study to a realistic project, it will b ... 
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1 Extending a relational database with deferred referential integrity checking and 
<g> intelligent joins 

^ Stephanie Cammarata, Prasadram Ramachandra, Darrell Shane 

June 1989 ACM SIGMOD Record , Proceedings of the 1989 ACM SIGMOD international 

conference on Management of data SIGMOD '89, Volume 18 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 
terms 



Full text available ^ pdf(1. 18 MB) 



Interactive use of relational database management systems (DBMS) requires a user to be 
knowledgeable about the semantics of the application represented in the database. In 
many cases, however, users are not trained in the application field and are not DBMS 
experts. Two categories of functionality are problematic for such users: (1) updating a 
database without violating integrity constraints imposed by the domain and (2) using join 
operations to retrieve data from more than one relation. We ... 

2 IRON file systems i 
Vijayan Prabhakaran, Lakshmi N. Bairavasundaram, Nitin Agrawal, Haryadi S. Gunawi, 
Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau 

October 2005 ACM SIGOPS Operating Systems Review , Proceedings of the twentieth 
ACM symposium on Operating systems principles SOSP '05, Volume 39 issue 

5 

Publisher: ACM Press 

Full text available: ^ pdf(323.82 KB) Additional Information: full citation , abstract , references , index terms 

Commodity file systems trust disks to either work or fail completely, yet modern disks 
exhibit more complex failure modes. We suggest a new fail-partial failure model for disks, 
which incorporates realistic localized faults such as latent sector errors and block 
corruption. We then develop and apply a novel failure-policy fingerprinting framework, to 
investigate how commodity file systems react to a range of more realistic disk failures. 
We classify their failure policies in a new ... 

Keywords: IRON file systems, block corruption, disks, fail-partial failure model, fault 
tolerance, internal, latent sector errors, redundancy, reliability, storage 



3 An introduction to data warehousing: what are the implications for the network? 
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Full text available: ^pdfd 45.35 KB) Additional Information: full citation , abstract , references , index terms 

Data warehousing is an information systems environment, rather than a product. It has 
emerged as an essential business entity for sophisticated analysis of data. This article 
presents a clear overview of the implications of data warehousing for business. © 1998 
John Wiley & Sons, Ltd. 

The Panasas ActiveScale Storage Cluster - Delivering Scalable High Bandwidth 
Storage 

Hong Tang, Aziz Gulbeden, Jingyu Zhou, William Strathearn, Tao Yang, Lingkun Chu 
November 2004 Proceedings of the 2004 ACM/IEE E conference on Supercom puting 
Publisher: IEEE Computer Society 

Full text available: QpdfM 99.24 KB) Additional Information: full citation , abstract 

Fundamental advances in high-level storage architectures and low-level storage-device 
interfaces greatly improve the performance and scalability of storage systems. 
Specifically, the decoupling of storage control (i.e., file system policy) from datapath 
operations (i.e., read, write) allows client applications to leverage the readily available 
bandwidth of storage devices while continuing to rely on the rich semantics of todayys file 
systems. Further, the evolution of storage interfaces from bio ... 

Microsoft TerraServer: a spatial data warehouse 
Tom Barclay, Jim Gray, Don Slutz 

May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 

conference on Management of data SIGMOD '00, Volume 29 issue 2 
Publisher: ACM Press 

Full text available: fiQ pdf(410.74 KB) Additional Information: full citation , abstract , references , citings, index 
^ ! terms 

Microsoft® TerraServer stores aerial, satellite, and topographic images of the earth in a 
SQL database available via the Internet. It is the world's largest online atlas, combining 
eight terabytes of image data from the United States Geological Survey (USGS) and 
SPIN-2. Internet browsers provide intuitive spatial and text interfaces to the data. Users 
need no special hardware, software, or knowledge to locate and browse imagery. This 
paper describes how terabytes of "Internet unfrie ... 

Keywords: VLDB, geo-spatial, image databases, internet 
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Alefiya Hussain, Genevieve Bartlett, Yuri Pryadkin, John Heidemann, Christos Papadopoulos, 
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August 2005 Proceeding of the 2005 ACM SIGCOMM workshop on Mining network 
data MineNet '05 

Publisher: ACM Press 
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One of the most pressing problems in network research is the lack of long-term trace data 
from ISPs. The Internet carries an enormous volume and variety of data; mining this data 
can provide valuable insight into the design and development of new protocols and 
applications. Although capture cards for high-speed links exist today, actually making the 
network traffic available for analysis involves more than just getting the packets off the 
wire, but also handling large and variable traffic ... 
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The SkyServer provides Internet access to the public Sloan Digital Sky Survey (SDSS) 
data for both astronomers and for science education. This paper describes the SkyServer 
goals and architecture. It also describes our experience operating the SkyServer on the 
Internet. The SDSS data is public and well-documented so it makes a good test platform 
for research on database algorithms and performance. 

NSF workshop on industrial/academic cooperation in database systems jjjjj 
Mike Carey, Len Seligman 

March 1999 ACM SIGMOD Record, Volume 28 issue l 
Publisher: ACM Press 

Full text available: -fg?) pdf(1 .96 MB) Additional Information: full citation , index terms 



An overview of data warehousing and OLAP technology 
Surajit Chaudhuri, Umeshwar Dayal 
March 1997 ACM SIGMOD Record, Volume 26 issue l 

Publisher: ACM Press 

Full text available: ^g| pdf(101.60 KB) Additional Information: full citation , abstract , citings , index terms 

Data warehousing and on-line analytical processing (OLAP) are essential elements of 
decision support, which has increasingly become a focus of the database industry. Many 
commercial products and services are now available, and all of the principal database 
management system vendors now have offerings in these areas. Decision support places 
some rather different requirements on database technology compared to traditional on- 
line transaction processing applications. This paper provides an overview ... 

10 Database research: achievements and o p portunities into the 1st century 
Avi Silberschatz, Mike Stonebraker, Jeff Ullman 
March 1996 ACM SIGMOD Record, Volume 25 issue l 
Publisher: ACM Press 
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Full text available* Ddf(147 90 KB) Addit ' onaJ Information: full citation , abstract , references , citings , index 
T^-P 2 — * terms 

Metadata updates, such as file creation and block allocation, have consistently been 
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identified as a source of performance, integrity, security, and availability problems for file 
systems. Soft updates is an implementation technique for low-cost sequencing of fine- 
grained updates to write-back cache blocks. Using soft updates to track and enforce 
metadata update dependencies, a file system can safely use delayed writes for almost all 
file operations. This article describes soft ... 

12 Dynamic Metadata Management for Petabyte-Scale File Systems Q 
Sage A. Weil, Kristal T. Pollack, Scott A. Brandt, Ethan L Miller 

November 2004 Proceedings of the 2004 ACM/IEEE conference on Supercomputing 
Publisher: IEEE Computer Society 

Full text available: Q pdf(1 75.04 KB) Additional Information: full citation , abstract 

In petabyte-scale distributed file systems that decouple read and write from metadata 
operations, behavior of the metadata server cluster will be critical to overall system 
performance and scalability. We present a dynamic subtree partitioning and adaptive 
metadata management system designed to efficiently manage hierarchical metadata 
workloads that evolve over time. We examine the relative merits of our approach in the 
context of traditional workload partitioning strategies, and demonstrate the ... 



13 Research problems in data warehousing 
Jennifer Widom 

December 1995 Proceedings of the fourth international conference on Information 
and knowledge management 

Publisher: ACM Press 
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14 Reading file metadata with extract and libextractor Q 
Christian Grothoff 

June 2005 Linux Journal, Volume 2005 issue 134 
Publisher: Specialized Systems Consultants, Inc. 

Full text available: jS] html(14.86 KB) Additional Information: full citation , abstract , index terms 

Where are the 400x200 PNG images I worked on in March? This system offers the 
answer. 

15 Educating software engineering students to manage risk Q 
Barry Boehm, Daniel Port 

July 2001 Proceedings of the 23rd International Conference on Software 
Engineering 

Publisher: IEEE Computer Society 

Full text available: Wi pdf(1 10.12 KB) 

J| Additional Information: full citation , abstract , references , index terms 

^ Publisher Site 

In 1996, USC switched its core two-semester software engineering course from a 
hypothetical-project, homework-and-exam course based on the Bloom taxonomy of 
educational objectives (knowledge, comprehension, application, analysis, synthesis, 
evaluation). The revised course is a real-client team-project course based on the CRESST 
model of learning objectives (content understanding, problem solving, collaboration, 
communication, and self-regulation). We used the CRESST cognitive demands analysis ... 

Keywords: process models, product models, project courses, property models, risk 
management, software engineering education, success models 
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